NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MetaAgents: Large Language Model Based Agents for Decision-Making on Teaming

https://doi.org/10.1145/3711032

Li, Yuan; Sun, Lichao; Zhang, Yixuan (April 2025, Proceedings of the ACM on Human-Computer Interaction)

Significant advancements have occurred in the application of Large Language Models (LLMs) for social simulations. Despite this, their abilities to perform teaming in task-oriented social events are underexplored. Such capabilities are crucial if LLMs are to effectively mimic human-like social behaviors and form efficient teams to solve tasks. To bridge this gap, we introduce MetaAgents, a social simulation framework populated with LLM-based agents. MetaAgents facilitates agent engagement in conversations and a series of decision making within social contexts, serving as an appropriate platform for investigating interactions and interpersonal decision-making of agents. In particular, we construct a job fair environment as a case study to scrutinize the team assembly and skill-matching behaviors of LLM-based agents. We take advantage of both quantitative metrics evaluation and qualitative text analysis to assess their teaming abilities at the job fair. Our evaluation demonstrates that LLM-based agents perform competently in making rational decisions to develop efficient teams. However, we also identify limitations that hinder their effectiveness in more complex team assembly tasks. Our work provides valuable insights into the role and evolution of LLMs in task-oriented social simulations.
more » « less
Full Text Available
HarmonyCloak: Making Music Unlearnable for Generative AI

https://doi.org/10.1109/SP61157.2025.00085

Ali_Meerza, Syed Irfan; Sun, Lichao; Liu, Jian (May 2025, IEEE)

Full Text Available
TinyGPT-MoE: Scaling Multi-modal Large Language Model via Advanced Vision Encoder with Mixture-of-Experts

Yuan, Zhengqing; Wang, Yang; Li, Zhaoxu; Ye, Yanfang; Sun, Lichao (December 2024, Workshop on Advancing Neural Network Training at International Conference on Machine Learning (WANT@ICML))

Full Text Available
FedCAP: Robust Federated Learning via Customized Aggregation and Personalization

https://doi.org/10.1109/ACSAC63791.2024.00067

Li, Youpeng; Wang, Xinda; Yu, Fuxun; Sun, Lichao; Zhang, Wenbin; Wang, Xuyu (December 2024, IEEE)

Full Text Available
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Yuan, Zhengqing; Wang, Yang; Li, Zhaoxu; Ye, Yanfang; Sun, Lichao (November 2024, Proceedings of Machine Learning Research)

Full Text Available
GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations

Duan, Jinhao; Zhang, Renming; Diffenderfer, James; Kailkhura, Bhavya; Sun, Lichao; Stengel-Eskin, Elias; Bansal, Mohit; Chen, Tianlong; Xu, Kaidi (December 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

As Large Language Models (LLMs) are integrated into critical real-world applications, their strategic and logical reasoning abilities are increasingly crucial. This paper evaluates LLMs' reasoning abilities in competitive environments through game-theoretic tasks, e.g., board and card games that require pure logic and strategic reasoning to compete with opponents. We first propose GTBench, a language-driven environment composing 10 widely-recognized tasks, across a comprehensive game taxonomy: complete versus incomplete information, dynamic versus static, and probabilistic versus deterministic scenarios. Then, we (1) Characterize the game-theoretic reasoning of LLMs; and (2) Perform LLM-vs.-LLM competitions as reasoning evaluation. We observe that (1) LLMs have distinct behaviors regarding various gaming scenarios; for example, LLMs fail in complete and deterministic games yet they are competitive in probabilistic gaming scenarios; (2) Most open-source LLMs, e.g., CodeLlama-34b-Instruct and Llama-2-70b-chat, are less competitive than commercial LLMs, e.g., GPT-4, in complex games, yet the recently released Llama-3-70b-Instruct makes up for this shortcoming. In addition, code-pretraining greatly benefits strategic reasoning, while advanced reasoning methods such as Chain-of-Thought (CoT) and Tree-of-Thought (ToT) do not always help. We further characterize the game-theoretic properties of LLMs, such as equilibrium and Pareto Efficiency in repeated games. Detailed error profiles are provided for a better understanding of LLMs' behavior. We hope our research provides standardized protocols and serves as a foundation to spur further explorations in the strategic reasoning of LLMs.
more » « less
Full Text Available
TinyGPT-MoE: Scaling Multi-modal Large Language Model via Advanced Vision Encoder with Mixture-of-Experts

Yuan, Zhengqing; Wang, Yang; Li, Zhaoxu; Ye, Yanfang; Sun, Lichao (July 2024, Proceedings of Machine Learning Research)

Full Text Available
Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing Noise

https://doi.org/10.1609/aaai.v38i4.28169

Liu, Yixin; Xu, Kaidi; Chen, Xun; Sun, Lichao (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

The open sourcing of large amounts of image data promotes the development of deep learning techniques. Along with this comes the privacy risk of these image datasets being exploited by unauthorized third parties to train deep learning models for commercial or illegal purposes. To avoid the abuse of data, a poisoning-based technique, unlearnable example, has been proposed to significantly degrade the generalization performance of models by adding imperceptible noise to the data. To further enhance its robustness against adversarial training, existing works leverage iterative adversarial training on both the defensive noise and the surrogate model. However, it still remains unknown whether the robustness of unlearnable examples primarily comes from the effect of enhancement in the surrogate model or the defensive noise. Observing that simply removing the adversarial perturbation on the training process of the defensive noise can improve the performance of robust unlearnable examples, we identify that solely the surrogate model's robustness contributes to the performance. Furthermore, we found a negative correlation exists between the robustness of defensive noise and the protection performance, indicating defensive noise's instability issue. Motivated by this, to further boost the robust unlearnable example, we introduce Stable Error-Minimizing noise (SEM), which trains the defensive noise against random perturbation instead of the time-consuming adversarial perturbation to improve the stability of defensive noise. Through comprehensive experiments, we demonstrate that SEM achieves a new state-of-the-art performance on CIFAR-10, CIFAR-100, and ImageNet Subset regarding both effectiveness and efficiency.
more » « less
Full Text Available
Attacking Neural Networks with Neural Networks: Towards Deep Synchronization for Backdoor Attacks

https://doi.org/10.1145/3583780.3614784

Guan, Zihan; Sun, Lichao; Du, Mengnan; Liu, Ninghao (October 2023, ACM)

Full Text Available
Decentralized Federated Learning: A Survey and Perspective

https://doi.org/10.1109/JIOT.2024.3407584

Yuan, Liangqi; Wang, Ziran; Sun, Lichao; Yu, Philip S; Brinton, Christopher G (January 2024, IEEE Internet of Things Journal)

Full Text Available

« Prev Next »

Search for: All records